SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: 

Seidel, Christoph; Weinhues, Ursula-Henrike; 
Schmitt, Urban; Motz, Manfred; Wiedmann, Michael; 
Upmeier, Barbara; Soutscheck, Erwin 



(ii) TITLE OF INVENTION: 

Recombinant antigen from the NS3 region of the hepatitis C virus 

(iii) NUMBER OF SEQUENCES: 9 



(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE: 


Felfe & Lynch 


(B) 


STREET : 


805 Third Avenue 


(C) 


CITY: 


New York 


(D) 


STATE : 


New York 


(E) 


COUNTRY : 


USA 


(F) 


ZIP : 


10022 



m (v) COMPUTER READABLE FORM: 

m (A) MEDIUM TYPE: 3.5 inch 1.44 Mb storage diskette 

O (B) COMPUTER: IBM PS/2 

hi (C) OPERATING SYSTEM: PC-DOS 

pi (D) SOFTWARE: Patentln Relase #1.0, 

s Version #1.25 (EPA) 

n\ (vi) CURRENT APPLICATION DATA: 

!n (A) APPLICATION NUMBER: 

X (B) FILING DATE: 

7*: (C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/511,759 

(B) FILING DATE: 7 -AUGUST- 1995y 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: p 44 28 705.4 

(B) FILING DATE: 12 AUGUST 1994 

(viii) ATTORNEY/AGENT INFORMATION 

(A) NAME: HANSON, NORMAN 

(B) REGISTRATION NUMBER: 30, 946 

(C) REFERENCE/DOCKET NUMBER: HUBR 1067.1 

(ix) TELECOMMUNICATION INFORMATION 

(A) TELEPHONE: 212-688-9200 

(B) TELEFAX: 212-838-3884 



(2) INFORMATION FOR SEP ID NO : 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) INITIAL ORIGIN: 

(A) ORGANISM: hepatitis C virus 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME SEGMENT: NS3 

(ix) CHARACTERISTICS: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..885 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



uklG ACC ATG ATT ACG AAT TCC CGG GGA TCC ATC ATG AAA TCC CCG GTG 48 
Sket Thr Met lie Thr Asn Ser Arg Gly Ser lie Met Lys Ser Pro Val 
m 15 10 15 

EjTC ACG GAT AAC TCC TCT CCA CCG GTA GTG CCC CAG AGC TTC CAG GTG 96 
fphe Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
W 20 25 30 

M3CT CAC CTG CAT GCT CCC ACA GGC AGC GGC AAG AGC ACC AAG GTC CCG 144 
-HAla His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
35 40 45 

GCT GCA TAC GCA GCT CAG GGC TAC AAG GTG CTA GTG CTC AAC CCT TCT 192 
Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 
50 55 60 

GTT GCT GCA ACA TTG GGC TTT GGT GCC TAC ATG TCC AAG GCT CAT GGG 240 
Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
65 70 75 80 

ATC GAT CCT AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT GGC AGC 2 88 

lie Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly Ser 

85 90 95 



CCC ATT ACG TAC TCC ACT TAC GGC AAG TTT CTT GCC GAC GGC GGG TGC 
Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
100 105 110 



336 



GCA GGG GGT GCT TAT GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC ACG 384 
Ala Gly Gly Ala Tyr Asp lie lie lie Cys Asp Glu Cys His Ser Thr 
115 120 125 

GAT GCC ACA TCC ATC TTG GGC ATC GGC ACT GTC CTT GAC CAA GGA GAG 432 
Asp Ala Thr Ser lie Leu Gly lie Gly Thr Val Leu Asp Gin Gly Glu 
130 135 140 

ACT GCG GGG GCG AAA TTG GTT GTG TTC GCC ACC GCC ACC CCT CCG GGC 480 
Thr Ala Gly Ala Lys Leu Val Val Phe Ala Thr Ala Thr Pro Pro Gly 
145 150 155 160 

TCC GTC ACT GTG CCC CAT CCC AAC ATT GAG GAG GTT GCT CTA TCC ACC 52 8 

Ser Val Thr Val Pro His Pro Asn lie Glu Glu Val Ala Leu Ser Thr 

165 170 175 

ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTT GAG GTA ATC 576 
Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val lie 
180 185 190 

AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AGG AAG TGC GAT 624 
CLys Gly Gly Arg His Leu He Phe Cys His Ser Lys Arg Lys Cys Asp 
& 195 200 205 

3pAG CTC GCC ACA AAG CTG GTC GCA ATG GGC ATC AAT GCC GTG GCC TAC 672 
ffslu Leu Ala Thr Lys Leu Val Ala Met Gly He Asn Ala Val Ala Tyr 
Q 210 215 220 

SjrAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGT GAT GTT GTC 720 
5 "Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 
^225 230 235 240 

SbTC GTG GCA ACC GAC GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC GAC 768 
%al Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 
jjU 245 250 255 

MtCG gtg ata GAC TGC AAC ACG TGT GTC ACT CAG ACA GTC GAT TTC AGC 816 
Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
260 265 270 

CTT GAC CCT ACC TTC ACC ATT GAG ACG ACC ACA CTT CCC CAG GAT GCT 864 
Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala 
275 280 285 

GTC TCC CGC ACT CAA CGA CGG 885 
Val Ser Arg Thr Gin Arg Arg 
290 295 



(3) 



INFORMATION FOR SEP ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Met lie Thr Asn Ser Arg Gly Ser lie Met Lys Ser Pro Val 
15 10 15 

Phe Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
20 25 30 

Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
35 40 45 

r=Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 
5 50 55 60 

"*P/al Ala Ala Thr' Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
~ 65 70 75 80 

■sssa. 

Hfle Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly Ser 
£j 85 90 95 

*_Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
y 100 105 110 

f'fl 

PAla Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr 
J3 115 120 125 

MAsp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Gly Glu 
130 135 140 

Thr Ala Gly Ala Lys Leu Val Val Phe Ala Thr Ala Thr Pro Pro Gly 
145 150 155 160 

Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser Thr 

165 170 175 

Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val He 
180 185 190 

* Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Arg Lys Cys Asp 
195 200 205 



Glu Leu Ala Thr Lys Leu Val Ala Met Gly He Asn Ala Val Ala Tyr 
210 215 220 

Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val Val 
225 230 235 240 

Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 

245 250 255 

Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
260 265 270 

Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala 
275 280 285 



Val Ser Arg Thr Gin Arg Arg 
290 295 



^(4) INFORMATION FOR SEP ID NO : 3 : 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



g^AGGGATCCA TCATGAAATC CCCGGTGTTC ACGGATAACT 




(5) INFORMATION FOR SEQ ID NO : 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGGAAGCCTT AATTCTTACC GTCGTTGAGT GCGGGAGAC 



(6) INFORMATION FOR SEP ID NO: 



5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGGGATCCA TCATGAAAGC GGTGGACTTT ATCCCTGTG 



(7) INFORMATION FOR SEP ID NO : 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGAAGCTTT TAACACGTGT TGCAGTCTAT CAC 
(8) INFORMATION FOR SEP ID NO : 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGGGATCCA TCATGAAACA CCTGCATGCT CCCACCGGC 



(8) INFORMATION FOR SEP ID NO : 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GAGAAGCTTT TAATACCAAG CACAGCCTGC GTC 



(3) INFORMATION FOR SEQ ID NO: 9: 

O (i) SEQUENCE CHARACTERISTICS: 

2 (A) LENGTH: 302 amino acids 

3 (B) TYPE: amino acid 
J (D) 'TOPOLOGY: linear 

SL* (ii) MOLECULE TYPE: protein * 

« (xi) SEQUENCE DESCRIPTION: SEQ ID Nf): 9: <•» ' l^nkkf^^ 

m J qJlA^ ^ ^ j ^ ~ 

l=?Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Val Phe Thr 

J! 1 5 10*' 15 

flAsp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val Ala 

p 20 ' 25 30 

His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 

35 40 45 

Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 

50 55 60 

Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala 

65 70 75 

His Gly He Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr 

80 85 90 

Thr Gly Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 

95 100 105 



Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He He Cys Asp 

HO 115 120 

Glu Cys His Ser Thr Asp Ala Thr Ser He Leu Gly He Gly Thr 

125 130 135 

Val Leu Asp Gin Gly Glu Thr Ala Gly Ala Lys Leu Val Val Phe 

140 145 150 

Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 

155 160 165 

He Glu Glu Val Ala Leu Ser Thr Thr Gly Glu He Pro Phe Tyr 

170 175 180 

Gly Lys Ala He Pro Leu Glu Val He Lys Gly Gly Arg His Leu 

185 190 195 

He Phe Cys His Ser Lys Arg Lys Cys Asp Glu Leu Ala Thr Lys 

200 205 210 

JLeu Val Ala Met Gly He Asn Ala Val Ala Tyr Tyr Arg Gly Leu 
1 215 220 225 

ULsp Val Ser Val Tie Pro Thr Ser Gly Asp Val Val Val Val Ala 
© 230 235 240 

ffhr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val 
W 245 250 255 

"Jle Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser Leu 
Q 260 265 . 270 

jj}sp Pro Thr Phe Thr He Glu Thr Thr Thr Leu Pro Gin Asp Ala 
45 275 280 285 

#al Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
*" 290 / 295 300 

Gly He 
302 



SEQUENCE PROTOCOL 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Boehringer Mannheim GmbH 

(B) ROAD: Sandhofer Str. 112-132 

(C) CITY: Mannheim 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 68305 

(ii) TITLE OF APPLICATION: Recombinant antigen from 

the NS3 region of the hepatitis C virus 

(iii) NUMBER OF SEQUENCES: 8 

(iv) COMPUTER READABLE FORM: 

(A) DATA CARRIER: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, 

Version #1.25 (EPA) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: both 

(D) TOPOLOGY: linear 



(ii) TYPE OF MOLECULE: cDNA 



INITIAL ORIGIN: 

(A) ORGANISM: hepatitis C virus 



(viii) POSITION IN THE GENOME: 

(A) CHROMOSOME/ SEGMENT: NS3 

(ix) CHARACTERISTICS : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



ATG ACC ATG ATT ACG AAT TCC CGG GGA TCC ATC ATG AAA TCC CCG GTG 4 8 

Met Thr Met lie Thr Asn Ser Arg Gly Ser lie Met Lys Ser Pro Val 
to l 5 10 15 

®C ACG GAT AAC TCC TCT CCA CCG GTA GTG CCC CAG AGC TTC CAG GTG 96 
- fhe Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin Ser Phe Gin Val 
0] 20 25 30 

jpT CAC CTG CAT GCT CCC ACA GGC AGC GGC AAG AGC ACC AAG GTC CCG 14 4 

Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 
^ 35 40 45 

&T GCA TAC GCA GCT CAG GGC TAC AAG GTG' CTA GTG CTC AAC CCT TCT 192 
lia Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser 

fU 50 55 60 

©T GCT GCA ACA TTG GGC TTT GGT GCC TAC ATG TCC AAG GCT CAT GGG 240 
Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His Gly 
65 70 75 80 

ATC GAT CCT AAC ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT GGC AGC 288 



He Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser 

85 90 95 

CCC ATT ACG TAC TCC ACT TAC GGC AAG TTT CTT GCC GAC GGC GGG TGC 
Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 
100 105 110 

GCA GGG GGT GCT TAT GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC ACG 
Ala Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser Thr 
115 120 125 

GAT GCC ACA TCC ATC TTG GGC ATC GGC ACT GTC CTT GAC CAA GGA GAG 
Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Gly Glu 
130 135 140 



ACT GCG GGG GCG AAA TTG GIT GTG TTC GCC ACC GCC ACC CCT CCG GGC 480 
Thr Ala Gly Ala Lys Leu Val Val Phe Ala Thr Ala Thr Pro Pro Gly 
145 150 155 160 

TCC GTC ACT GTG CCC CAT CCC AAC ATT GAG GAG GTT GCT CTA TCC ACC 528 
Ser Val Thr Val Pro His Pro Asn lie Glu Glu Val Ala Leu Ser Thr 

165 170 175 

ACC GGA GAG ATC CCT TTT TAC GGC AAG GCT ATC CCC CTT GAG GTA ATC 576 
Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro Leu Glu Val lie 
180 185 190 

AAG GGG GGG AG A CAT CTC ATC TTC TGT CAT TCA AAG AGG AAG TGC GAT 624 
Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys Arg Lys Cys Asp 
195 200 205 

GAG CTC GCC ACA AAG CTG GTC GCA ATG GGC ATC AAT GCC GTG GCC TAC 672 
Glu Leu Ala Thr Lys Leu Val Ala Met Gly lie Asn Ala Val Ala Tyr 
210 215 220 

TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGT GAT GTT GTC 720 
Tyr Arg Gly Leu Asp Val Ser Val lie Pro Thr Ser Gly Asp Val Val 
225 230 235 „ 240 

GTC GTG GCA ACC GAC GCC CTC ATG ACC GGC TAT ACC GGC GAC TTC GAC 7 68 

VS1 Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp 
ifl 245 250 255 

Tib GTG ATA GAC TGC AAC ACG TGT GTC ACT CAG ACA GTC GAT TTC AGC 816 

Sir Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe Ser 
:i; 260 265 270 

CS#T GAC CCT ACC TTC ACC ATT GAG ACG ACC ACA CTT CCC CAG GAT GCT 8 64 

Mu Asp Pro Thr Phe Thr lie Glu Thr Thr Thr Leu Pro Gin Asp Ala 
275 280 285 



TCC CGC ACT CAA CGA CGG 885 

%1 Ser Arg Thr Gin Arg Arg 

[ft 290 295 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

/ZO~7 

Met Thr Met lie Thr Asn Ser Arg Gly Ser He Met Lys Ser Pro 



Val 



10 



15 



Phe Thr Asp Asn Ser Ser Pro Pro Val Val Pro 
20 25 

Ala His Leu His Ala Pro Thr Gly Ser Gly Lys 
35 40 

Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu 
50 55 

Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met 
65 70 75 

lie Asp Pro Asn lie Arg Thr Gly Val Arg Thr 

85 90 

Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu 
100 105 

Ala Gly Gly Ala Tyr Asp lie lie lie Cys Asp 
115 120 



Gin Ser Phe Gin Val 
30 

Ser Thr Lys Val Pro 
45 

Val Leu Asn Pro Ser 
60 

Ser Lys Ala His Gly 

80 

lie Thr Thr Gly Ser 
95 

Ala Asp Gly Gly Cys 
110 

Glu Cys His Ser Thr 
125 



Asp Ala Thr Ser lie Leu Gly lie Gly Thr Val 
130 135 



Leu Asp Gin Gly Glu 
140 



Thr Ala Gly Ala Lys Leu Val Val Phe Ala Thr 
145 150 155 

Ser Val Thr Val Pro His Pro Asn He Glu Glu 

165 170 

Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He 
180 185 



Ala Thr Pro Pro Gly 

160 

Val Ala Leu Ser Thr 
175 

Pro Leu Glu Val He 
190 



Lys Gly Gly Arg His Leu He Phe Cys His Ser 
195 200 

Glu Leu Ala Thr Lys Leu Val Ala Met Gly He 
210 215 

Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr 
225 230 235 

Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr 

245 250 

Ser Val He Asp Cys Asn Thr Cys Val Thr Gin 

260 265 

Leu Asp Pro Thr Phe Thr He Glu Thr Thr Thr 
275 280 

Val Ser Arg Thr Gin Arg Arg 
290 295 



Lys Arg Lys Cys Asp 
205 

Asn Ala Val Ala Tyr 
220 

Ser Gly Asp Val Val 

240 

Thr Gly Asp Phe Asp 
255 

Thr Val Asp Phe Ser 

270 

Leu Pro Gin Asp Ala 
285 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 

AAGGGATCCA TCATGAAATC CCCGGTGTTC ACGGATAACT 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

GGGAAGCCTT AATTCTTACC GTCGTTGAGT GCGGGAGAC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 



(D) TOPOLOGY: linear 



(ii) TYPE OF MOLECULE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

GAGG6ATCCA TCATGAAAGC GGTGGACTTT ATCCCTGTG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

GAGAAGCTTT TAACACGTGT TGCAGTCTAT CAC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
GAGGGATCCA TCATGAAACA CCTGCATGCT CCCACCGGC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND FORM: single 

(D) TOPOLOGY: linear 

(ii) TYPE OF MOLECULE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

GAGAAGCTTT TAATACCAAG CACAGCCTGC GTC 



